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Abstract — In this paper, we address the transmit antenna 
selection in multi-user MIMO systems with precoding. The 
optimum and reduced complexity sub-optimum antenna selec- 
tion algorithms are introduced. QR-decomposition (QRD) based 
antenna selection is investigated and the reason behind its sub- 
optimality is analytically derived. We introduce the conventional 
QRD-based algorithm and propose an efficient QRD-based trans- 
mit antenna scheme (maxR) that is both implementation and 
performance efficient. Moreover, we derive explicit formulae for 
the computational complexities of the aforementioned algorithms. 
Simulation results and analysis demonstrate that the proposed 
maxR algorithm requires only 1% of the computational efforts 
required by the optimal algorithm for a degradation of ldB and 
O.ldB in the case of linear zero-forcing and Tomlinson-Harashima 
precoding schemes, respectively. 

I. Introduction 

Dirty paper coding (DPC), first proposed by Costa [1 ,'], is the 
optimal capacity precoding approach for downlink multi-user 
multiple-input multiple-output (MU-MIMO) communication 
systems. DPC schemes use the channel state information 
(CSI), available at the transmitter by means of feedback, and 
the a priori known users' data to eliminate or reduce the co- 
channel interference (CCI). Therefore, the number of available 
transmit antennas at the base station (BS) should be larger than 
or equal to the total number of receive antennas at the mobile 
stations (MSs). 

Linear precoding schemes have low computational com- 
plexity but they are susceptible to amplify the noise or to 
introduce inter-user interference 0. Tomlinson-Harashima 
precoding (THP) is a nonlinear scheme that overcomes the 
noise amplification by introducing the modulo (MOD) opera- 
tion, which reduces the required transmit power Q, |@). The 
error performance of the aforementioned precoding schemes 
is still far to fulfill the requirements of the future generations 
of mobile communication systems. 

Transmit antenna selection remarkably improves the system 
performance by exploiting the spatial selectivity. Therefore, 
when the number of antennas available at the BS is larger 
than the number of RF chains, the subset of antennas with the 
best channel conditions can be selected and switched to the 
RF chains. 

This work was supported by the Korea Science and Engineering Foundation 
(KOSEF) grant funded by the Korea government (MOST) (No. R01-2008- 
000-20333-0). 



Antenna selection is broadly researched for single user 
MIMO multiplexing systems (0-0 and references therein), 
but fewer results were obtained for MU-MIMO systems J8). 
In this paper, we address the transmit antenna selection for 
MU-MIMO systems with precoding. 

Contributions. Our original contributions in this paper are as 
following. 

• We introduce the optimal selection algorithm for DPC 
MU-MIMO system and a suboptimal reduced complexity 
algorithm that achieves a quasi-optimal performance. 

> We investigate the reason behind the sub-optimality of 
the QR-decomposition-based (QRD-based) antenna se- 
lection. Then, we introduce a reduced complexity and 
low latency QRD-based antenna selection algorithm that 
achieves optimum diversity and lags the optimum per- 
formance by less than ldB and O.ldB in case of linear 
precoding and THP, respectively. 

• Finally, we derive explicit formulas for the computational 
complexities of the introduced algorithms as a function 
of the number of required complex operations. 

The remaining parts of this paper are as following. Section 
II introduces the system model and a review of the precoding 
schemes. In section III, we introduce the optimum SNR-based 
selection algorithm and its reduced complexity version . In 
Section IV, QRD-based antenna selection is investigated, and 
the proposed maxR selection algorithm is introduced. The 
complexities of antenna selection algorithms is derived in 
Section V, simulation results are shown in Section VI, and 
conclusions are drawn in Section VII. 

II. System Model and Review of Precoding 
Techniques 

A. System Model 

We consider a down-link (DL) MU-MIMO system, where 
a BS communicates simultaneously with K non-cooperative 
MSs. Also, BS is equipped with M RF chains and N > M 
antennas, and each MS has a single antenna. In this paper, 
we consider the number of RF chains is equal to the number 
of MSs, i.e., AI = K, and the best M out of N antennas 
are selected for the DL communication with the MSs. Under 
the assumption of narrow-band flat-fading channel, the MU- 
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Fig. 1. Block diagram of Tomlinson-Harashima precoding. 



MIMO system can be modeled as following: 



y = Hx + n, 



(1) 



where y e C A/ is the vector whose element is the received 
signal at the fc-th MS, and x € C M is the precoded transmitted 
vector, n € C M is the additive white Gaussian noise whose 
mean and variance are zero and o\, respectively. Finally, H is 
the channel matrix whose element hk,i is the transfer function 
between the i-th transmit antenna and the single antenna of 
the fc-th MS. The elements of H are independent and follow 
complex Gaussian distributions. 

B. Linear Precoding Schemes 

Linear zero-forcing precoding (LZF) cancels the effect of 
the channel by precoding the transmitted data vector using the 
pseudo-inverse of the channel matrix. 
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where the scaling factor 7 is present to fix the expected total 
transmit power to (Pt)', that is, 



7 : 



^{(HH*)- 1 } 



(3) 



where Tr(-) refer to the trace operation. As a consequence, 
the receive SNR at any MS is given by: 

E(ss*) 



SNR 



(4) 



If the channel matrix is ill-conditioned, 7 becomes large and 
consequently the post-processing signal to noise ratio (SNR) is 
decreased. To overcome this drawback, linear minimum mean- 
square error (MMSE) precoding can be used to regularize the 
channel matrix. The precoded signal using LMMSE is given 
by: 
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where a = Ka^/Pr is the regularization factor. 



C. Tomlinson-Harashima Precoding (THP) 

In the case of ill-conditioned channel matrix, linear pre- 
coding leads to degradation in the receive SNR. To overcome 
this problem, THP includes the non-linear modulo operation 
(MOD) as shown in Figure Q] The modulo operation is defined 
by: 
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(6) 



where r is chosen depending on the modulation constellation, 
and [b\ rounds the real and imaginary parts of b to the 



closest lower integers. The feedforward (F) and feedback 
(B) matrices are obtained using the QR-decomposition of the 
Hermitian transpose of H. Let H H = QR, then F = Q H , 
and B = GR H . G = diag (l/i? M , l/R.2,2, ■■■ , I/Rm.m) is 
a diagonal matrix whose elements are used as scaling factors 
at the receivers. 

A better performance can be achieved when the channel 
matrix is regularized as following: 



H = [H y/3L M ] E 



iM xN+M 



(7) 



Then the QRD of the matrix H is applied to obtain the 
feedforward and feedback matrices. 

Due to space limitation, we will only consider throughout 
this paper LZF and ZF-THP algorithms. 

D. Achievable Sum Rates by Precoding Schemes 

For linear pre-coding, the post-processing signal to noise 
ratio (SNRp) is inversely proportional to the scaling factor 7, 
i.e., SNRp = p/j, where p = E(s*s)/a 2 . Therefore, the sum 
rate capacity of linear ZF precoding is given by: 



C rf = A-.E[log 2 (l + p/7)] ; 



(8) 



where the maximum achievable sum rate of K ■ log 2 (l + p) 
is attained when the antennas are perfectly uncorrected. 
On the other hand, the sum rate for ZF-THP is given by: 

K 

C rf .THP=^l0g 2 (l + (9) 

i=l 

III. SNR-based Antenna Selection 

A. SNR-based Optimum Antenna Selection 

When linear pre-coding algorithms are employed, the scal- 
ing factor 7 is fixed for all users. Thus, 7 is selected in 
such a way the post-precessing SNR is maximized. Thus, to 
optimize the post-processing SNR, the optimal antenna subset 
is selected as: 



A pt = argmin 7 {Hp}, 

AeS p=i,— ,p 



(10) 



where P = 7 {Hp} = Tr{(HpHf) and P T = 

1, i.e., SNR = l/of r Although this algorithm is optimized 
for linear precoding schemes, it is also performance opti- 
mum for successive interference cancellation because the best- 
conditioned matrix is selected. 

The optimum antenna subset A op t is obtained by employing 
exhaustive search over the C^ f possible subsets. 

B. SNR-based Reduced-Complexity (RC) Antenna Selection 

To reduce the complexity of the SER-based optimum an- 
tenna selection, we propose to successively remove antennas 
one-by-one until the remaining number of antennas equals to 
M. This strategy is adapted from [;8|, where antenna selection 
algorithm are investigated for MU-MIMO with block diago- 
nalization. At each iteration, the antenna whose deactivation 
maximizes the post-processing SNR, i.e., minimizes 7, is 
removed. This procedure is repeated iteratively till the number 



Input: H, N, M 

1: U = N, V = N — M, Arc = S 
for i = 1 to V do 
for k = 1 to U do 



H, 



H 



Tr{(H s Hf)^} 



{remove the fc" 1 column} 



7fc 
end for 

fc 4 = argmin (7*,) 

k=l,— ,u 
U = U-1 

Arc = Arc {^i} 

H = H fci 
end for 

H„ = H(:, Arc) 



{deactivate antenna fc;} 
{remove the k\ h column} 



Output: Hp, A r 

Fig. 2. SNR-based reduced-complexity antenna selection algorithm. 



of remaining antennas is equal to M. The full description of 
the reduced complexity antenna selection is described in Fig. 
12 Note that H p = H(:,_4 rc ) is the sub-matrix of H with 
elements from rows 1 to M and columns indexes <E A rc - 

Therefore, to obtain the subset A rc , only partial antenna 
combinations are to be searched over instead of the 
possible combinations. 

IV. QRD-based Antenna Selection 

The QR-decomposition (QRD) of the channel matrix was 
used in the literature to select a good but not certainly best 
subset of the transmit antennas in spatial multiplexing systems. 
Although, the achieved performance of MIMO system employ- 
ing QRD antenna selection is not optimum, it is preferable due 
to its low computational complexity. 

In light of the QRD of the channel matrix, the scaling factor 
is given as a function of the matrix R as follows: 
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^{(R^R- 1 )}. 
Let A = R , then, due to the triangular structure of R 
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where i?^ are the entries on the diagonal of R. The equality in 
(fT2l is satisfied iff H is orthogonal matrix, i.e., R is diagonal. 

Therefore, a sub-optimum antenna subset can be obtained 
by minimizing X)f=i W~- This can be done by maximizing 
the diagonal elements of R. For that, QRD-based antenna 
selection algorithms are not optimum due to discarding the 



second summation of (fl2l) . In the following, we introduce two 
QRD-based antenna selection algorithms. 

A. Antenna Selection using a Single QRD (Single-QR) 

In the literature, single-QR antenna selection algorithm was 
used for spatial multiplexing systems which employ successive 
interference cancellation detection. In this paper we employ 
single-QR algorithm for transmit antenna selection in MU- 
MIMO system with precoding. The main idea behind single- 
QR algorithm is to maximize the Y2i=i ^ii by us i n g onr y one 
QR decomposition of the channel matrix. At the first iteration, 
the column of the channel with the largest power is selected. 
The remaining (TV— 1) columns are then orthogonalized, where 
the orthogonalized column with the largest power is considered 
for the second iteration. This process is repeated till selecting 
M out of the possible transmit antennas. 

Although single-QR algorithm has respectively low com- 
plexity, the selection of the column with the largest power, 
which may have high correlation with other columns, at the 
first iteration can lead to degradation in its performance as 
compared to that of the optimum antenna selection algorithm. 

In the following Section we introduce a performance and 
complexity efficient antenna selection algorithm based on the 
QRD. 

B. Maximum-R (maxR) Antenna Selection Algorithm 

To ensure obtaining the best antenna subset among the 
combinations, an exhaustive search should be employed to 
obtain the subset with minimum 

Ei=i W~- Although this 
search scheme can be done successively, where unnecessary 
computations are avoided, its extreme-case complexity is still 
inapplicable. 

Noting that the problem of the single-QR algorithm is in the 
selection of the first antenna, we consider equally all antennas 
to be selected at the first iteration despite their condition. Thus, 
at first, the norms of the columns of the channel matrix are 
calculated, and their descending order w is obtained such that 
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At the first stage of the proposed maxR algorithm, the best 
metric bm is initiated to oo, the column with the largest power 
is considered at the first QRD iteration leading to reduction in 
1/-R? i- The remaining (N — 1) columns of the matrix Q are 
then orthogonalized and the one with largest power is selected 
for the second iteration. This process is repeated till obtaining 
the first antenna subset of M elements. Also, the metric 
D = Yli=i V-^ii i s calculated successively, and assigned to 
bm. At the second stage of the proposed algorithm, the w 2 -th 
column of the matrix H is selected at the first iteration of the 
QRD, where the remaining columns are orthogonalized and 
the corresponding D = 1/R\ 1 is calculated. At any iteration, 
if D exceeds the already-found best metric bm, the stage is 
stopped and the algorithm moves to the next stage. Otherwise, 
the algorithm proceeds until the last iteration, where D is 
updated successively. 

Fig. [3] gives the detailed description of the proposed maxR 



TABLE I 

Computational complexity of the transmit antenna selection schemes . 



Scheme 


Complexity (flops) 


C/Cop, x 100 
(N = 8, M = 4) 


C/C opt x 100 

(AT = 14, M = 10) 


Optimum 


■ (10A/ 3 - ±A/ 2 + \M - 1) 


100 


100 


RC 


M 2 (^N 3 + 4M7V 2 + §M + 4MN - ^-M 3 - f|M 2 - \N 2 - 
23 7V+ ii ) + M( 2 7V 3 + i JV 2 8 aJV+ l) 


41.1 


5.2 


maxR 


8N 2 M 2 + \N 2 + §Af 2 A/ + Ij-NAI - 4A r A/ 3 - ^NM 2 - §AT 


13.9 


1.0 


singleQR 


SNM 2 + |ATM - 4M 3 - i^M 2 - |Af - ±N 


1.8 


0.07 



Input: H, N, M 



1: R = 0m, Q = H, bm = oo, AmaxR = S 

2: for i = 1 to 7Y do 
3: normsi = ||hi|| 2 

4: end for 

5: w = sort(norniSi, descend) 

6: for j = 1 to iV do 

7: Tnorms = norms, R = 0m 

8: Q = H, D = 0, V = 5 

9: for i = 1 to Af do 
10: if i = 1 then 
11: k = Wj 

12: else 

13: A; = arg max (Tnorms m ) 

14: end if 

15: D = D + 1/TnormSfe 
16: if D > bm then 
17: break 
18: end if 

19: Exchange columns i and k in Q, R, Tnorms and V 

20: iii.i = |qj 

21: q = q///... 

22: for m = % + 1 to TV do 

23: Ri, m = qf • q m 

24: q m = q m - Ri,m ■ q s 

25: Tnormsm = Tnorms m - R~ }Tn 

26: end for 

27: if i = M then 

28. *ArnaxR — V(l : M) 

29: bm = D 

30: end if 
31: end for 

32: end for 

33: Hp = H(:, AmaxR) 



Output: Hp, AmaxR 

Fig. 3. Maximum-R (maxR) Antenna Selection Algorithm. 

transmit antenna selection algorithm. Note that n is the n x n 
matrix whose elements are all zeros, and S = {1, 2, ■ ■ • , N} 
is the set of all available antennas. 

The proposed maxR algorithm is easily pipelined due to its 
parallel nature, where pipelining leads to tremendous reduction 
in the latency particularly for large N. 

V. Computational Complexity of the Transmit 
Antenna Selection Algorithms 

The computational complexities of the introduced transmit 
antenna selection algorithms are derived in this section. The 
addition and multiplication operations are counted as one 



flop and three flops, respectively. Also, the matrix inversion 
is obtained using the Gauss- Jordan row operations 0. In 
Table H] the computational complexities of the aforementioned 
algorithms are given for K = M. Moreover, two numerical 
examples for (TV = 8, M = 4) and (N = 14, M = 10) are 
given, where C / C opt is the ratio between the complexity of the 
antenna selection algorithm and that of the optimum algorithm. 
Single-QR algorithm requires the least computational efforts 
among all the presented algorithms. The maximum computa- 
tional complexity of proposed maxR algorithm is about 1 % as 
compared to that of the optimum algorithm for N = 14 and 
M = 10. This maximum complexity of the proposed maxR is 
required when the algorithm is processed in parallel to reduce 
the latency, whereas complexity is reduced when the algorithm 
is processed sequentially. 

VI. Simulation Results and Discussions 

In this section we evaluate the bit error rate (BER) perfor- 
mance of the introduced antenna selection algorithms for LZF 
and THP-ZF precoding schemes using 16-QAM. The channel 
is considered to be perfectly known at the transmitter. We 
consider 4 uncooperative MSs each equipped with a single 
receive antenna, and a single BS equipped with N = 5, 6, 
or 8 transmit antennas from which only 4 are selected and 
switched to the 4 available RF chains. 

Fig. E] depicts the BER performance of the LZF precoding 
for various transmit antenna selection algorithms. For N = 8, 
i.e., 4 additional antennas, and at BER of 10 -4 , degradations 
of 0.25dB, 0.8dB, and 1.8dB are remarked when RC, maxR, 
and single-QR antenna selection algorithms are used, respec- 
tively, compared to the optimum performance. 

Fig. [5] depicts the BER performance of the THP-ZF scheme 
for various transmit antenna selection algorithms. For N = 8 
and at a target BER of 10 -4 , RC, maxR, and single-QR 
antenna selection algorithms lead to degradations of 0.05dB, 
0.09dB, and 0.5dB, respectively, compared to the optimum 
performance. 

From Fig. |4] and [5] we conclude that the linear precoding 
scheme is more sensitive to the antenna selection scheme 
compared to the THP algorithm for which a quasi-optimum 
performance is achieved using sub-optimal antenna selection 
algorithms. Moreover, THP-ZF outperforms LZF precoding 
for all antenna configurations and for the different antenna 
selection algorithms. For instance, for N = 8 and at BER of 
10" 5 , THP-ZF outperforms LZF by 2.65dB. 
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Fig. 4. Performance of the LZF precoding for several transmit antenna 
selection algorithms for M = K = 4, and TV = 5, 6, and 8. 



Fig. 6. Achieved sum rates by LZF (left) and THP-ZF (right) for TV : 
TV = 12. 
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VII. Conclusions 
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Fig. 5. Performance of THP-ZF for several transmit antenna selection algo- 
rithms for M = K = 4, and TV = 5, 6, and 8. 



Fig. [6] shows the achieved sum rates for both LZF and THP- 
ZF precoding schemes for TV = 6 and TV = 12, i.e., 2 and 
8 additional antennas, respectively. The sum rate achieved 
by LZF precoding is remarkably increased for the few first 
additional antennas. This is due to its sensitivity to the 7 
factor which is reduced as the number of additional antennas 
is increased. For large number of additional antennas, the 
E[7] « 1 and the upper-bound on the capacity can be achieved. 
At SNR of 30dB, The SNR-optimum algorithm achieves the 
best sum rate, while RC, the proposed maxR, and single-QR 
algorithms lag the optimum capacity by 0.045, 0.07, and 0.2 
bits/s/Hz/user, respectively. 

In case of THP-ZF, the proposed max-R algorithm achieves 
the best sum rate followed by single-QR and the optimum 
scheme, while RC algorithm achieves the lowest sum rate 
capacity. This is because the sum rate of THP-ZF is directly 
affected by the diagonal elements of the matrix R which 
are maximized in both single-QR and the proposed maxR 



In this paper, we introduced four transmit antenna selection 
algorithms for downlink MU-MIMO systems, where each user 
is equipped with a single receive antenna. The optimal and a 
greedy suboptimal transmit antenna selection algorithms are 
presented. Furthermore, the suboptimal QRD-based antenna 
selection is investigated, then we introduce the conventional 
QRD-based algorithm and propose an efficient QRD-based 
transmit antenna scheme (maxR), which is both implemen- 
tation and performance efficient. Simulation results show that 
the proposed maxR antenna selection algorithm performs close 
to the optimal algorithm while requiring only a small fraction 
of its computational complexity. 
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